The World of Computer Software

home *** CD-ROM | disk | FTP | other *** search

/ The World of Computer Software / The World of Computer Software.iso / ts1st20.zip / STATMEAS.INF < prev next >

Wrap

Text File | 1992-09-22 | 15KB | 378 lines

- 1 - Tue 22-September-1992 (All rights reserved) About TS1ST in General (Statistical univariate measures) ====================== Apply question mark ? with the program call for a brief description of a program. This package may be used and distributed freely for NON-COMMERCIAL, NON-INSTITUTIONAL, PRIVATE purposes, provided it is not changed in any way. ┌────────────────────────────────────────────────────────────────┐ │ For ANY other usage (such as use in a business enterprise or a │ │ university) or the full scale version contact the author for a │ │ personal or a site license. │ └────────────────────────────────────────────────────────────────┘ Please do not distribute any part of this package separately. Uploading to BBSes is encouraged. The registered version is strictly for the registrant only. Identical programs must NOT be running on more than one computer at a time. Site licensed programs must not be run outside the licensed site. The programs are under development. Comments and contacts are solicited. If you have any questions, please do not hesitate to use electronic mail for communication. InterNet address: ts@uwasa.fi (preferred) Bitnet address: SALMI@FINFUN The author shall not be liable to the user for any direct, indirect or consequential loss arising from the use of, or inability to use, any program or file howsoever caused. No warranty is given that the programs will work under all circumstances. Timo Salmi Professor of Accounting and Business Finance Faculty of Accounting and Industrial Management University of Vaasa P.O. BOX 297, SF-65101 Vaasa, Finland CONTENTS: 1. Acknowledgments 2. General Description 3. Formulae 4. The Other Statistical Programs in the Set 5. The Statistics Set for Other Computers 6. Release Notes for Statmeas 7. List of the files in the package - 2 - 1. ACKNOWLEDGEMENTS Useful help by the following colleagues in testing STATMEAS and discussing the statistical concepts is gratefully acknowledged: Associate Professor Martti Luoma, Lecturer Antti Kanto, and Assistant Markku Malkamäki. 2. GENERAL DESCRIPTION STATMEAS (Ver 2.0) STATistical MEASures program is part of the interactive statistical system by Timo Salmi. It is the first program in the set, and is intended for univariate statistical analysis. (For the other programs in the set see Chapter 4.) STAMEAS includes a built-in help system, which can be invoked by typing ? at any interactive question. Because of this built-in help, and the interactive nature of the program's user interface, no long-winding instructions have been included. (Who reads instructions anyhow?) The program calculates for a set of observations in accordance with the user's choice: 1) Standard statistical measures 2) Autocorrelations 3) Runs and 4) Draws frequency diagrams. The data can either be given from the keyboard or taken from a file. If the input is to be taken from a file it must first be prepared with some editor, or some word processor which includes an option for preparing ordinary ascii text. The data is given to the program in the following format: X1 X2 X3 !variable names (! denotes a comment) 3.56 6.32 -1.73 5.12 -4.21 9.18 14.2 5.11 0.31 END !END is optional in a file A missing item in an observation is marked by a hash (#). E.g. if the first item of the second observation were missing, the observation should be written as # -4.21 9.18 The items in an observation can be separated with blanks, as in the above, or with commas (,) e.g. 5.12,-4.21,9.18. The number of the intervening blanks is irrelevant, and can be customized for increased readability. Thus e.g. 5.12 -4.21 9.18 and 5.12 -4.21 9.18 are equivalent. A row can be continued using an ampersand (&). E.g. the variables could be given as X1 X2 & X3 Alternatively, * can be used instead of & as the continuation marker. Comments can be added to the input data. If ! appears on a line all text after ! will be considered as a comment. A header can be entered on each page if output is directed to a file. To accomplish this start the very first line on the input file with a double exclamation mark (!!) and the rest of the line will be used as the header. Thus !! indicates a header, a single ! an ordinary comment. The maximum number of variables is 25. The maximum number of observations is 400 (for each variable). The public domain version, however, sets the limits at 4 and 100 respectively. 3. FORMULAE This chapter gives a selected summary of the univariate statistics formulas used in STATMEAS. Denote N = number of observations (of the relevant variable) x(j) = value of the j'th observation (for the relevant variable) Mean kx = Σ x(j) / N ____________________ Standard deviation s = √ Σ (x(j)-kx)²/ (N-1) Variance s² = Σ (x(j)-kx)²/ (N-1) Coefficient of variation = standard deviation / mean 3 3 ½ Skewness g(1) = Σ (x(j)-kx) / (Ns ) standard error (6/N) 4 ½ Kurtosis g(2) = Σ (x(j)-kx) / (Ns ) - 3 standard error (24/N) N-lag Σ (x(j+lag)-kx)(x(j)-kx) j=1 Autocorrelation r(lag) = ---------------------------- N Σ (x(j)-kx)² j=1 _______________________ │ lag-1 Standard error std(r(lag)) = │ (1 + Σ 2 r(i)²) / N √ i=1 t-value = r(lag) / std(r(lag)) lag Ljung-Box Q = N(N+2) Σ r(i)² / (N - i) i=1 Run statistics: Denote n1 = number of negative changes n2 = number of no changes n3 = number of positive changes n4 = n1 + n2 + n3 expected number of runs = (n4(n4+1) - n1² - n2² - n3²) / N number of runs - expected number of runs + 0.5 Standardized k-value = ---------------------------------------------- std deviation of runs 4. THE OTHER STATISTICAL PROGRAMS IN THE SET STATMEAS is the first statistical program in a set of programs intended for analyzing statistical data. The second program in the set is STATREGR (STATistics: multiple REGRession), which also has been released. Plans include STATTRAN (STATistics TRANsformations) for effecting transformations (such as logarithms, differences, roots, etc.) on the data, and STATRCOR (STATistic Ranks and CORrelations) which is intended for parametric and non-parametric correlation analysis. STATTRAN, STATRCOR, and STATLADR have now been released to complete the set. Each of the programs appears in a different .zip file. The names are: TS1STxx.ZIP <-- STATMEAS TS2STxx.ZIP <-- STATREGR TS3STxx.ZIP <-- STATTRAN TS4STxx.ZIP <-- STATRCOR TS5STxx.ZIP <-- STATLADR 5. THE STATISTICS SET FOR OTHER COMPUTERS The Statistical programs by Timo Salmi are (were) also available for ill-fated the Sinclair QL computer. Named STATPREP the system is part of a Public Domain library for QUANTA members. The descriptions of the files in the Quanta Library are given below as taken directly from a QL database. The central part of the package has also been programmed for the VAX/VMS by the author, but he input and output commands of the VAX "REGRE" and "LADRE" programs are in Finnish, and the VAX versions are not public domain. (LADRE is a Least Absolute Deviation multiple REgression analysis.) In Quanta Library: STATPREP_DOC Quill documentation of Timo Salmi's statistical package. Associated programs STATPREP_EXE, STATTRAN_EXE, STATMEAS_EXE, STATRCOR_EXE, STATREGR_EXE, STATDIAG_EXE, and STATMENU. STATMENU Besides EXEC_W'ing the statistical package programs you can LRUN STATMENU and let it take care of the calls. Makes the package still easier to use. Alter the default device(s) (FDK1_) for your own configuration, if necessary. STATPREP_EXE (statistics preparation editor) An editor of a kind used for entering the data for the associated statistical programs. Maximum 10 variables with 200 observations (each). STATTRAN_EXE (transformations on variables) Used for making transformations on observations prepared with STATPREP_EXE. Transformations include taking logarithms, raising to power, taking roots, scaling, and multiplying between observations. Takes its input from a <statprep> file prepared by STATPREP_EXE and creates a new <statprep> file with the original and the transformed variables. STATMEAS_EXE (statistical measures) Standard statistical measures and frequency diagram for any of the variables in the data. Measures include minimum, maximum, mean, median, standard deviation, variance and kurtosis. STATRCOR_EXE (ranks and correlations) Calculates both the usual (Pearson's) correlation coefficients and Spearman's rank correlation coefficients. Also gives, on request, ranks for the observations of any of the variables. Note that computing Spearman's rank correlations takes quite awhile. STATAURU_EXE (autocorrelations and runs) Calculates autocorrelations up to the desired lag. Also calculates runs statistics. STATREGR_EXE (multiple regression) Full-scale multivariate regression on the data prepared with STATPREP_EXE (and STATTRAN_EXE). STATDIAG_EXE (scatter diagram) Draws scatter diagrams of the data prepared with STATPREP_EXE (and STATTRAN_EXE). The user has the choice of the X-axis and the Y-axis. If the user specifies a regression model (estimated by STATREGR_EXE), the predicted (i.e. regressed) values can be plotted on the scatter diagram. Furthermore, a scatter diagram of the residual can be drawn against any of the variables if the user has specified the regression model. 6. RELEASE NOTES FOR STATMEAS Versions 1.1 and 1.2 of STATMEAS include some minor changes and corrections in the user interface to make it fully compatible with the other programs in the set. Version 1.3: If the input file is not found, you have the choice of listing directories from within STATREGR. The directory routine has been rewritten for a more relaxed syntax. (For details see the information on DIRW in TSUTIL.INF in TSUTILxx.ZIP version 1.8 or later.) Version 1.4: An attempt to rewrite a protected file no longer crashes the program. Version 1.5: This version introduces input recall and line editing. The special keys Del, CrLf, CrRg, CrUp, Home, End, and Esc are functional for this purpose. PgUp is the recall key. Line editing uses insert mode. Disk access has been made faster (the program has its own cache). The directory routine has been updated. Read-only files can be read properly. Version 1.6: While fixing a bug in the multiple regression program I also made some enhancements in the routines which are common to all my statistical programs. Most importantly the insert key has been made functional in the line editing. Version 1.7: The line editing potential of the program has been improved. When the task is given from the keyboard, and continuation lines are used, the repeated input recall (CursorUp) gets each line in turn. Ctlr-C and break-key abort have been enhanced. A line can be continued using an &, or a * at the end. Now also the backslash \ is accepted. This was suggested by Tuomas Eerola. Some help texts within the program have been extended. Version 1.8: Several improvements to the nuts and bolts of the user interface. The new usage of the call is PROGNAME [/h(elp)] [/iInputFileName] [/oOutputFileName] [/cColumnsPerRow] (the /c option, which regulates the width of the output, is for registered versions, only). If you use the /i switch, it stuffs the InputFileName into the appropriate recall buffer. This means that when the program asks you for the input file name, you can invoke the input file name just by pressing the CursorUp key. (The same goes for the /o switch, respectively.) This is very convenient, if you use the program many times successively making small changes in your data in between. (This assumes, of course, that you have a command line editor like DOSEDIT or CED to recall previous MsDos commands. These common shareware programs can be obtained from any well-stocked BBS or FTP site.) The printer readiness test has been rewritten to be more general. The earlier test failed for some printers, because the codes the printers send when they are offline are not standardized. The "file exists, overwrite?" question is no more asked when the output file is prn, in other words when the output is directed to the printer. The user has now a choice of a left margin from 0 to 20 blanks when output is directed to the printer. The user has now a choice between formfeed and four blank lines to start each new page of output. When an input file is not found, the user is given the choice of listing a directory. The directory routine has been rewritten. The file ready message now also includes the file side besides the name. Version 1.9: The calculation of the coefficient of variation was added. Version 2.0: The input and output file names can be optionally given as parameters in the program calls, e.g. STATMEAS /ic:\stat\test.dat /or:\tmp This option has been improved. The "prefilled" name (e.g. c:\stat\test.dat) will now automatically appear on the input line without the need of pressing the cursor up key. All you need to do is to press enter. Also made some minor internal changes not worth recording. Rewrote the document files using a 68 column wrap instead of the former 80 to make the text easier to read and handle. Added the list of files in the package to the documentation. 7. LIST OF THE FILES IN THE PACKAGE TS1ST Statistics by T.Salmi, Part I Filename Comment -------- -------------------------------- FILE_ID.DIZ Brief characterization of TS1ST STATMEAS.EXE Statistical univariate measures STATMEAS.INF Document TSPROG.INF List of PD programs from T.Salmi VAASA.INF Info: Finland, Vaasa, U of Vaasa ---- ------ ------ ----- 0005